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Decoding Error-Correcting Codes Based on Finite Geometries 
Field of the Invention 

[01] The present invention relates generally to the field of error-correcting 
codes for data storage and data transmission, and more particularly to a decoding 
method for a class of error-correcting codes based on finite geometries. 

Background of the Invention 

[02] A fundamental problem in the field of data storage and communication is 
the development of practical decoding methods for error-correcting codes (ECC), 
see Blahut, "Algebraic Codes for Data Transmission" Cambridge University 
Press: Cambridge, 2003, and Mac Williams et al., "The Theory of Error-Correcting 
Codes," North-Holland: New York, NY, 1977. 

Binary Linear Block Error-Correcting Codes 

[03] Any references to "codes" herein specifically mean binary linear block 
error-correcting codes. The basic idea behind these codes is to encode messages of 
k information bits using blocks of N bits, where N > k. The additional N -k bits are 
used to decode and correct corrupted messages. A block of N bits is also 
sometimes called a code-word, or simply a "word." 

[04] Corruption can be caused by failure in the storage media or noise in the 
transmission channel. Herein, it is assumed that the transmission channel can be 
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modeled as a binary symmetric channel (BSC). In a binary symmetric channel, 
each bit can be independently flipped with some probability p. 

[05] A code C is defined by set of 2* possible blocks of "code-words" having 
a block length N. k is sometimes called the "dimension" of the code. The "rate" R 
of a code is defined by R = k/N. Codes are normally much more effective when N 
and k are large. However, as the size of the parameters N and k increases, so does 
the difficulty of decoding corrupted messages. 

[06] The Hamming distance between two words is defined as the number of 
bits that differ in the two words. The distance d of a code is defined as the 
minimum Hamming distance between all pairs of code-words in the code. Codes 
with a larger value of d have a greater error-correcting capability. Codes with 
parameters N and k are referred to as (N, k) codes. If the distance d is also known, 
then they are referred to as (N, k y d) codes. 

[07] The class of binary linear block error-correcting codes includes a great 
variety of sub-classes of codes. Some of the most notable of these sub-classes are 
the low-density parity check (LDPC) codes; Bose, Chaudhuri, and Hocquenghen 
(BCH) codes; and codes based on finite geometries, such as Reed-Muller codes. 

[08] Reed-Muller Codes and other Codes Based on Finite Geometries 

[09] One important class of error-correcting codes are the Reed-Muller codes, 
described by Reed et al., in "A Class of Multiple-Error-Correcting Codes and the 
Decoding Scheme" IRE Trans. Inform. Theory, vol. 4, pp. 38-49, Sept. 1954, and 
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Muller, "Application of Boolean Algebra to Switching Circuit Design to Error 
Detection," IRE Trans. Electron. Comput., vol 3, pp. 6-12, Jan. 1954. 

[010] Other multiple-error correcting codes based on finite geometries, herein 
"finite geometry codes," were developed in the 1960's and 1970's, for an 
overview, see chapter 8 of Lin et al., "Error Control Coding: Fundamentals and 
Applications" Prentice Hall: Englewood Cliffs, NJ, 1983. 

[Oil] Finite geometry codes can be considered generalizations of Reed-Muller 
codes. The class of Reed-Muller codes is a sub-set of the class of Euclidean 
Geometry codes, which is a sub-set of a class of finite geometry codes. 

[012] It is possible to determine the parameters N, K and d of finite geometry 
codes. The distances of such codes are somewhat worse than the distances of 
comparable Bose, Chaudhuri, and Hocquenghen (BCH) codes. See any of the 
textbooks on error-correcting codes previously mentioned for a description of BCH 
codes. On the other hand, a relative advantage of finite geometry codes is that their 
decoders are much simpler to implement than decoders for BCH codes. 

[013] Majority-Logic, Bounded-Distance, and Maximum-Likelihood 
Decoders 

[014] Up to now, Reed-Muller codes and other finite geometry codes have 
generally been decoded by a decoding method known as "majority-logic 
decoding." Majority-logic decoding is simple to implement and is extremely fast. 
For a description of majority-logic decoding, see chapter 13 of Blahut, "Algebraic 
Codes for Data Transmission" Cambridge University Press: Cambridge, 2003. 
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[015] A majority-logic decoder is an example of a "bounded-distance decoder." 
The standard prior-art decoders for BCH codes, which are based on algebraic 
methods are also bounded-distance decoders. 

[016] A bounded-distance decoder decodes any received word to a nearest 
code- word, so long as the received word has a Hamming distance to the nearest 
code-word that is less than or equal to the bounded distance decoding radius t, 
where t = [(</ -1)/2_|. Here, the floor function |_*J indicates that the fractional part 
of x is subtracted. There can be at most one code- word within distance t or less of a 
word. Therefore, a bounded-distance decoder successfully decodes the received 
word to the transmitted code-word when the channel introduces t or fewer bit flips. 
Conversely, a bounded-distance decoder fails to decode when the received word 
has a distance from any code-word that is greater than the decoding radius t. If the 
channel introduces t or more bit flips, then the bounded-distance decoder fails to 
correctly decode the transmitted code-word. 

[017] The error-correcting performance of a bounded-distance decoder is 
normally much worse than the performance of an optimal, "maximum-likelihood" 
decoder. A maximum-likelihood decoder decodes any received word to the nearest 
code-word, whatever the Hamming distance from the received word to the nearest 
code-word. Unfortunately, the complexity of true maximum-likelihood decoders 
increases exponentially with the number of information bits k, making them 
impractical for codes with a reasonably large number of information bits. 

[018] It is desired to provide a practical decoder, which, while not quite 
optimal, performs much better than a bounded-distance decoder. Such a decoder 
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should be capable of decoding correctly many, if not all, received words that are at 
a distance greater than t from any code-word. 

[019] It is an object of the invention to provide such a decoder for Reed-Muller 
and other codes based on finite geometries. Such a decoder should greatly- 
outperform the prior-art majority-logic decoding method. 

[020] Bit-Flipping Decoders 

[021] Bit-flipping decoders have been described for low-density parity check 
(LDPC) codes, see Gallager, "Low Density Parity Check Codes," MIT Press, 1963. 
As originally described by Gallager, those decoders represent the LDPC code by a 
parity check matrix. Such a decoder stores a current state of a received word to be 
decoded as a list of elements. Each element represents one bit of the received 
word. Each element can take on a value of 0 or 1. The list of elements is initialized 
to match the bits received after passing through a channel. At each iteration in the 
Gallager bit-flipping decoding method, every element in the list is tested to see 
whether it should be flipped. An element is flipped if a sufficient number of parity 
checks including that bit indicate that the element has the wrong value. The 
decoder is continually iterated, with the elements flips decided based on the latest 
updated values of the other elements. When the bit flipping procedure terminates, 
the values of the elements are taken as the bit values of the decoded word. 

[022] Similar tri-state bit flipping decoding methods were described in the 
context of decoding LDPC codes by Richardson et al., "The Capacity of Low- 
Density Parity-Check Codes Under Message-Passing Decoding" IEEE Trans. 
Information Theory, vol. 47, num. 2, pp. 599-619, Feb. 2001, and by 
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Mitzenmacher "A Note on Low Density Parity Check Codes for Erasures and 
Errors" SRC Technical Note 1998-017, December 1998. In a tri-state bit flipping 
decoder, the elements of the word being decoded can have values of 0, 1, or ?, 
where the "?" value indicates that the exact state of the element is uncertain. 

[023] It is desired to use a bit-flipping decoding method to decode a completely 
different class of codes, namely codes based on finite geometries, rather than 
LDPC codes. 

[024] Bit-flipping decoders for LDPC codes are not bounded-distance 
decoders. In contrast to codes based on finite geometries, it is normally impractical 
to even compute the distance d, for a given LDPC code. The fact that bit-flipping 
decoders are not bounded-distance decoders means that there is no guarantee that 
is possible to decode a received word even if it is only corrupted by t or fewer bit 
flips. 

[025] On the other hand, bit-flipping decoders have the advantage of 
potentially being able to decode-words that are corrupted by more than t bit flips. 
Bit-flipping decoders have, up to now, not been used for binary block codes other 
than LDPC codes. The primary reason is that up to now, bit-flipping decoders have 
been thought to be inappropriate for anything but LDPC codes. 

[026] Although Reed-Muller codes date from 1954, and bit-flipping decoders 
for LDPC codes date from the early 1960's, nobody has previously devised a way 
to use a bit-flipping decoder for Reed-Muller codes or other finite geometry codes. 
A possible explanation for this fact is that finite geometry codes must be 
represented in a non-obvious way for bit-flipping decoders to perform well. 
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[027] Parity Check Matrix Representations of Codes 

[028] A binary block code is "linear" when the modulo 2 sum of any two code- 
words is also is a code- word. For example, the modulo 2 sum of 1110100 and 
0111010 is 1001110. A parity check matrix can represent a linear code. In fact, 
many different parity check matrices can represent a linear code. 

[029] The parity check matrix representing an (TV, k) code is defined by a 
matrix of 7's and 0's, with M rows and N columns. The N columns of the parity 
check matrix correspond to the N bits of the code. 

[030] Each row of the parity check matrix represents a parity check constraint. 
The bits involved in the constraint represented by a particular row correspond to 
the columns that have a 7 in that row. The parity check constraint forces the sum of 
the bits to be even, or equivalently 0 when summed modulo 2. For example, the 
parity check matrix 



[031] 



1110 10 0 
0 1110 10 
0 0 1110 1 



(1) 



represents three constraints 

jc, © x, © jc 3 © jc 5 = 0 (2) 

x 2 @x 3 ®x 4 @x 6 =0 (3) 

x 3 © x 4 © jc 5 © jc 7 = 0 , (4) 
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[032] where the element *, represents the value of the f 1 bit, which can be 0 or 
7, and the symbol © represents the modulo 2 sum. This parity check matrix is one 
way to represent the well-known Hamming (N=7, k=4, d=3) code. 



[033] Some of the M different constraints represented by the parity check 
matrix are linearly dependent. In the parity check matrix given in equation (1), all 
the constraints are linearly independent. Another representation for the same code 
is the parity check matrix 



1 1 1 
0 1 1 
0 0 1 



0 1 

1 0 
1 1 



0 0 



0 0 11 
10 0 1 
0 10 0 



1 
0 
1 
1 
1 



110 10 0 1 



(5) 



[034] In this parity matrix, the last four rows are obtained by adding together 
combinations of the first three rows. This means that the last four rows are merely 
redundant constraints, and there are actually only three linearly independent rows. 
For example, the fourth row can be obtained by adding together the first two rows. 

[035] The number of linearly independent rows of a matrix is sometimes called 
the "rank" of the matrix. Therefore, this matrix has a rank of three. In general, the 
rank of an M by TV parity check matrix representation of an (N, k) code is N - k. 
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[036] Finite Geometries 

[037] Like other binary linear block codes, finite geometry codes can be 
represented by a parity check matrix. Of special interest are codes based on finite 
geometries known as "Euclidean geometries." Other codes are based on finite 
geometries known as "projective geometries." Although some details are different, 
the principles are the same. For more details see chapter 8 of Lin et al., "Error 
Control Coding: Fundamentals and Applications," Prentice Hall: Englewood 
Cliffs, NJ, 1983. 

[038] A finite geometry is a mathematical construct containing a finite number 
of objects that are analogous to the points, lines, planes, and hyper-planes of 
ordinary geometry. A finite geometry has a finite number of points called "0-flats." 
Certain sets of those points are called "lines," or "1-flats." Other sets of lines are 
called "planes," or "2-flats." Sets of planes are called "3-flats," and sets of 3-flats 
are called "4-flats," and so on. In general, the sets are called "//-flats." These are 
well-known definitions in the finite geometry literature. 

[039] A Euclidean geometry EG(m, q) is characterized by two parameters, m 
and q. The "points" or "0-flats" in EG(m, q) are vectors of m different #-ary 
elements. For example if q = 2 and m = 5, then the points of EGim = 5, q = 2) are 
vectors consisting of three bits. Thus the eight points of EG(m = 3, q = 2) are 000, 
001,010, 011, 100, 101, 110, and 111. 

[040] The points of EG(m, q) can be labeled a number of different ways. In the 
example of EGim = 3, q = 2), one natural labeling is 000 = 0, 001 = 1, 010 = 2, 
011 = 3, 100 = 4, 101 =5, 110 = 6, 111 =7. 
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[041] The q-ary elements are to be added or multiplied together according to 
the rules of a Galois field GF(q). See any textbook on error-correcting codes, 
including the ones previously referenced, for a description of Galois fields. When q 
= 2, the rules of the Galois field are particularly simple. For GF(q = 2), the 
addition rules are 0+0 = 0, 0+1 = 1+0 = 1, and 1+1=0, and the multiplication rules 
are 0*0 = 0*1 = 1*0 = 0, and 1*1 = 1. 

[042] Lines in a finite Euclidean geometry are defined much as a line is defined 
in an ordinary geometry. A line in EG(m y q) is a set of points {a 0 + } , where a 0 
and a j are different points in EG(m, q), ft runs over all the g-ary elements except 
0, and addition and multiplication are defined using the arithmetic rules of GF(q). 
As an example, for the Euclidean geometry EG(m = 3,q = 2), it can be seen see 
that if a0=OlO, and aj=lll, then the corresponding line is {010, 101}. Every pair 
of points forms a line in the Euclidean geometry EG(m=3,q=2). 

[043] Higher order //-flats are defined in a similar way. For example, planes (2- 
flats) are defined as sets of points {a 0 + fla, + fi 2 a 2 } , where a 0 , a u and a 2 are linearly 

independent points in EG(m,q), and fi } and $run over all the non-zero g-ary 
elements. For example, if ##=001, «;=010, and a 7 =100, then the corresponding 
plane is {001, 011, 101, 111}. Using the labeling of points that were selected 
previously for EG(m=3,q=2), this plane has points {1,3,5,7}. 

[044] When defining codes based on Euclidean geometries, the focus is on all 
the //-flats that do not include the all-zero point. In the example using 
EG(m=3,q=2J, the all-zero point is 000 = 0, and the planes that do not include the 
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all-zero point are {1,2,4,7}, {1,2,5,6}, {1,3,4,6}, {1,3,5,7}, {2,3,4,5}, {2,3,6,7}, 
and {4,5,6,7}, where the labels are as defined for the points. 

[045] For a collection of n objects in total, and a set of those objects, one can 
define an "incidence vector" of the set with respect to the objects, which is a vector 
of n 0's and l's, where a 1 in position i indicates that that the i object is in the set. 
For example, if there are seven objects with labels 1 to 7, and one considers the set 
{1,2,5,6}, then the incidence vector for the set {1,2,5,6} with respect to the 
numbers 1 through 7 is 1100110. 

[046] In the example using EG(m=3,q=2), if the n objects are the 7 non-zero 
points, then the incidence vectors for the planes that do not contain the all-zeros 
point with respect to the non-zero points are 1101 00 1,11 00 110, 1011010, 
1010101, 0111100, 0110011, and 0001111. 

[047] Codes Based on Finite Geometries 

[048] A Euclidean Geometry EG(m,s, v) code can be defined by a parity check 

matrix whose rows are the incidence vectors of all (v+7)-flats in EG(m,2 s ) that do 

not contain the all-zeros point with respect to the non-zero point. For example, the 

EG(m=3,s=l, v=l) code can be defined by the parity check matrix 

"l 1 0 1 0 0 f 
110 0 110 
10 110 10 
10 10 10 1. 
0 11110 0 
0 110 0 11 
0 0 0 1 1 1 1 
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[049] It is important to note that the parity check matrix for a Euclidean 
geometry code defined in this way contains redundant rows. For example, although 
there are 7 rows in the parity check matrix for the EG(m=3,s=l, v=l) code above, 
only 3 of the rows are linearly independent, and the code is actually equivalent to 
the (n=7, k-4, d=3) Hamming code that was used previously as an example. 

[050] With an appropriate re-labeling of the points, a code based on a 

Euclidean Geometry can be defined in a cyclic form. For example, if the labeling 

010=1, 100=2, 011=3, 001=4, 101=5, 110=6, 111=7 is used, then the parity check 

matrix is defined as 

1 1 1 0 1 0 0~ 
0 1110 10 
0 0 1110 1 
10 0 1110 
0 10 0 111 
10 10 0 11 
110 10 0 1 

that has a circulant form making the cyclic symmetry more obvious. 

[051] As mentioned previously, Reed-Muller codes can be considered a special 
case of a code based on a Euclidean geometry. To be more precise, Reed-Muller 
codes are Euclidean geometry codes with s=l. 

[052] Codes based on other finite geometries are defined in a similar way. One 
uses the incidence vectors of high-order ju -flats with respect to the points of the 
finite geometry to define a parity check matrix for the code. 
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[053] As described above, the most commonly used prior art decoders for finite 
geometry codes are majority-logic decoders, which are bounded-distance decoders. 

[054] It is desired to provide a practical decoding method which out-performs 
majority-logic decoders for finite geometry codes. 

Summary of the Invention 

[055] The present invention provides a method for decoding an (N, k) finite 
geometry code, where N is the block-length of the code, and k is the dimension of 
the code. 

[056] During initialization, the code is first represented by a parity check 
matrix with M rows and N columns, of rank N-k. Normally, the parity check matrix 
is highly redundant. 

[057] Then, the decoder can receive words. For the purposes of this invention, 
the words received by the decoder are N bit blocks of 0's and Ts. 

[058] The initialization procedure stores the received word in an AT bit channel 
register x chame i. 

[059] The received word is also stored in an active register x active . The register 
Xaaive also stores N tri-state elements, where each element has a value of '0', T, or 
4 ?\ where a? indicates that the decoder is uncertain about the correct value. 
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[060] Finally, the initialization procedure sets initial values of two thresholds 
bjii P and 

"uncertain- 

[061] Then, the decoder begins an iterated decoding cycle. 

[062] The first step of the decoding cycle checks for a termination condition. 
The termination condition is either that the current state of the register x active is a 
code-word, or that some other condition is satisfied, such as a predetermined 
number of decoding cycles have been performed. 

[063] If the termination condition is satisfied, then the decoder outputs the 
contents of the register x active . 

[064] Otherwise, in the second step of the decoding cycle, the contents of 
register x active are used to determine updated "votes" that parity checks send to the 
elements. 

[065] In the third step of the decoding cycle, the votes that parity checks send 
to elements are used to determine a new state of the register x active . The number of 
checks that tell each allayment to set its value to a'O'or'l' is compared with the 
thresholds bji ip and b uncer1ain in order to determine the state of each element in the 
register x active . 

[066] In the fourth step of the decoding cycle, the thresholds b flip and b uncertain 
are updated according to a pre-determined schedule. 
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[067] Then, the decoding cycle returns to the check for a termination condition, 
and the cycle is repeated until the termination condition is satisfied. 

[068] If the decoder fails to output a code-word after the termination condition 
has been satisfied, then the process can be repeated any number of times, each time 
substantially increasing the number of parity checks in the parity check matrix. 

Brief Description of the Drawings 

[069] Figure 1 is a flow diagram of an initialization procedure according to the 
invention; 

[070] Figure 2 is a flow diagram of the decoding procedure according to the 
invention; and 

[071] Figure 3 is a flow diagram of the decoding procedure with restart. 

Detailed Description of the Preferred Embodiment 

[072] Defining a Parity Check Matrix Representation for the Code 

[073] Figure 1 and 2 shows steps of a method for decoding an error-correcting 
code based on a finite geometry according to our invention. The code has a block- 
length N and a dimension k. The code is based on a finite geometry with a given 
number of points N and a given number of //-flats, as known in the art. 
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[074] Figure 1 shows a one-time initialization procedure 100, and Figure 2 
shows the step of an iterative decoding procedure 200. 

[075] Initialization 

[076] More precisely, we define the code 101 by a decoding parity check 
matrix (H decode ) 111 whose M rows are incidence vectors of a sub-set of /i-flats in 
the finite geometry with respect to the points in the finite geometry. 

[077] We define the M by N parity check matrix by selecting a sub-set of the ju- 
flats in the finite geometry, and using their incidence vectors with respect to the 
points of the finite geometry as the rows of our parity check matrix 111. We call 
this parity check matrix H decode . 

[078] We can select any sub-set of the //-flats, as long as a rank of the parity 
check matrix H decode is N - k. The performance of the decoder improves when we 
choose a larger number of //-flats. However, a large number M of //-flats also 
increases the complexity of the decoder. Normally, the number of //-flats is 
substantially larger than N-k, in order for the decoder to perform well. 

[079] In the ordinary parity check matrix representation for an (N,k) LDPC 
code, the number of parity checks, i.e., rows in the matrix, is either exactly N-k or 
very close to N-k. Bit-flipping decoders work reasonably well with LDPC codes, 
even using a relatively small number of parity checks, because each parity check 
sends relatively reliable information. Parity checks in LDPC codes only involve a 
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small number of bits, which makes them relatively reliable, because it is unlikely 
that very many of the bits are in error. 

[080] Parity checks for a finite geometry code tend to be somewhat less reliable 
than those for LDPC codes, because each parity check for a finite geometry code 
tends to involve a somewhat larger number of bits than the parity checks in a 
comparable LDPC code. For this reason, it is very important that the number of 
parity checks used for finite geometry codes is substantially larger than N - k. 

[08 1] Fortunately, it is possible to select more than N-k parity checks for a 
finite geometry code, because there are normally many more than N-k of the 
appropriate //-flats. In fact, there are often so many //-flats, that when we select 
only a small fraction of the possible //-flats, we still have a sufficient number. 

[082] In the finite geometry, each //-flat contains the same number of points J, 
so each of the M rows of our parity check matrix has / ones, and N - J zeros. For a 
EG(m, q) Euclidean geometry, each //-flat contains J = q m points. 

[083] We prefer to use //-flats that are obtainable as cyclic shifts of each other. 
In that case, each column of the parity check matrix has the same number of ones. 

[084] For convenience, we also prefer to use a termination parity check matrix 
(Hterminate) 1 12, representing the same code 101, that has a smaller number of rows 
than Hdecode- This second parity check matrix H terminate , is used to check 210, in part, 
whether a termination condition is satisfied during the decoding cycles. The matrix 
Hterminate has rank N - k, and only N-k rows. 
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[085] Given the parity check matrices 1 1 1-1 12 representation of the code, and a 
received word 102, we store 120 the received word in a N-bit register x channet 121. 
We store the received word because at every iteration of the decoding cycle, the 
decoder compares votes of parity checks with channel evidence to make decisions. 

[086] We also store 130 the received word in a second Af-element register x active 
131 The register x active represents a state of the decoder for a value of each of the N 
bits of the code. Each element of the register x active can take on one of three values: 
'0', '1\ or '?'. The T value indicates that the decoder is uncertain about the 
correct value of the corresponding element. 

[087] We set 140 two thresholds, b flip and b uncertain to positive integers. The 
thresholds are set so that b flip > b mcertain . Both thresholds are normally set to large 

integers of the order of the number of ones in a column of the parity check matrix 
111. 

[088] The threshold bfl ip is normally set to be substantially larger than the 
threshold b uncertain . For example, if the number of ones in a column of the parity 
check matrix 1 1 1 is K, than a good initial setting for the thresholds would be 
b fiip » AT, and b uncertain ~9KI\0. There needs to be a gap between the two thresholds 

for the uncertain state of the elements to be used in the updating rule 240. 



18 



MERL-1494 
Yedidia et al. 

[089] Decoding 

[090] The decoder 200 repeatedly cycles 201 through the steps until the 
termination condition is satisfied in the first step 210 of the decoding cycle. When 
the termination condition is satisfied, the decoder outputs 135 the contents of the 
register x active 131 as an estimate of the transmitted code- word 209. 

[091] Termination Conditions 

[092] The first step 210 of the decoding cycle is to check whether any 
termination condition is satisfied. A number of termination conditions can be used. 
One termination condition checks whether the state of the register x active 
corresponds to a code-word. For this to be true, none of the elements of x active have 
the value '?\ and all the parity checks in the parity check matrix H termination 1 12 are 
satisfied. If the register x actiV€ contains the code- word, then the decoder terminates, 
and outputs 135 the code- word 209. 

[093] Another termination condition ensures that the decoder does not cycle 
through the decoding cycle forever. There are various possible conditions that 
would ensure that does not happen. The simplest possibility is a termination 
condition that checks whether a fixed maximum number of cycles have already 
been performed, or a fixed amount of time has elapsed. If the termination condition 
is reached and register x active does not correspond to a code- word, then the decoder 
has clearly failed to decode the received word perfectly. The decoder still outputs 
the contents of the register x activ€ because many of the individual bits are likely to 
be correct. In addition, the entire process can be restarted with a larger parity check 
matrix as shown in Figure 3. 
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[094] Thus, as shown in Figure 3, if the decoder fails to output a code- word after 
the termination condition has been satisfied, then the process can be repeated a 
number of times, each time substantially increasing 300 the number of parity 
checks in the parity check matrix 111. Such a procedure enables most of the 
received words to be decoded using a relatively small number of computations, 
while still decoding the more difficult received words by using more parity checks 
when necessary. 

[095] The limits on using such a re-starting procedure are the number of //-flats 
available for representing the code, and the time available for decoding. 

[096] Parity Check Votes 

[097] Assuming that the termination condition is not satisfied, the next step 220 
of the decoding cycle evaluates 220 "votes" that parity checks in the matrix 1 1 1 
send to the elements according to a vote rule 219 described below.. 

[098] There are M parity checks. Each parity check a corresponds to a row of 
the parity check matrix H decode 1 1 1 . For each parity check, the parity check sends 
votes to J elements. Each such element corresponds to a column of the parity check 
matrix that has the value 1 in the row a. 

[099] We say that a element "belongs" to a parity check when the parity check 
sends the element a vote. In total, there are MJ votes. 
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[0100] Each vote from a parity check a to an element i can take on three possible 
values: a '0', a '1', or an abstain. A '0' vote means that the parity check votes that 
the element takes on the value 0, a '1' vote means that the parity check votes that 
the element takes on the value 1, and a '?' vote is an abstention. That is, the parity 
check does not vote on the value of the element because the value is uncertain. 

[0101] The parity check a sends an element i a vote based on the following rule 
219. 

[0102] If any of the J - 1 elements that belong to a, other than i currently are in 
the '?' state in the register x active , then the vote is abstain. Otherwise, if an odd 
number of the other / - 1 elements are in the state ' 1 ' , then the vote is a ' 1 '.If an 
even number of the other J - 1 elements are in the state '0', then the vote is a '0'. 

[0103] The point of the rule 219 is that each parity check tries to make certain 
that an even number of associated elements are in the state T. 

[0104] Updating Elements 

[0105] After all MJ votes from parity checks to elements have been computed, 
the values of the N elements, stored in x active 13 1 are updated 230. 

[0106] Before the elements are updated, we determine a "recommendation" and 
a "strength" for that recommendation for each element from the parity check votes. 
For each element i, the number of parity checks voting that the element is a '1' is 
compared to the number of parity checks voting that the element is a '0'. 
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[0107] The value (Tor '0') that receives the most votes is considered to be the 
recommendation of the parity checks, and the magnitude of the difference between 
the votes is considered to be the strength of the recommendation. 

[0108] For example, if eight parity checks vote that the third element has the 
value '0', and six parity checks voted that the element has the value 4 1\ then the 
parity checks' recommendation for the third element is '0', with the strength 8 - 
6=2. If there is a tie in the votes for the element, then that element has no 
recommendation from the parity checks. 

[0109] We update values of each of the elements in x active according to the 
following update rule 229. 

[0110] For each of the N elements, if the recommendation of the parity checks 
agrees with the value of the corresponding element in x channel 121, or if there is no 
recommendation, then the updated value of x active 131 is equal the value of the 
corresponding element in x channe i 121. If the recommendation of the parity checks 
for a element disagrees with the value of the corresponding element in x channeh then 
there are three possibilities. If the strength of the recommendation is less than the 
threshold b uncertaim then the value of the element is set to equal the value of the 
corresponding element in x channe i. If the strength of the recommendation is greater 
than or equal to b uncertaim but less than the threshold b flip , then the value of the 
element is set to '?*. Finally, if the strength of the recommendation is greater than 
or equal to the threshold bji ip , then the value of the element is set to equal the 
recommendation of the parity checks. 
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[0111] The basic idea behind our rules is that the decoder matches each element 
to the corresponding bit in the received word, unless a sufficient number of parity 
checks vote to 'flip' the value. If an intermediate number of parity checks vote to 
flip, the element is set to the uncertain value '?'. As mentioned previously, there is 
a gap between the values of threshold b flip and threshold b uncertain for the uncertain 
value for the elements to be used. The larger the gap between the thresholds, the 
more often the uncertain value is used. 

[0112] Updating the Thresholds 

[0113] In the next step of the decoding cycle, the thresholds b flip and b uncertain are 
updated 240 according to a pre-determined schedule 239. Empirically, a best 
schedule initializes the two thresholds 141-142 to large positive values with a fixed 
difference between the values, and then decreases both thresholds by the same 
amount at each iteration of the decoding cycle, until the thresholds reach the value 
of zero, where they remain. 

[01 14] The idea behind such a decoding schedule is that initially, the decoder 
considers, most strongly, the evidence of the received word stored in x chamel 121. 
As the thresholds decrease, the votes of the parity checks become increasingly 
influential. 

[01 15] The initial effects of the parity checks merely sets elements to uncertain 
values. Eventually, using this kind of schedule, the parity checks become more 
aggressive, and flip the values of elements away from the value given by the 
received word. Thus, the decoder can be characterized as being "conservative," in 
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that it tries to accept and follow the values of the bits given in the received word, 
and only makes large changes when necessary. 

[0116] After the thresholds are updated 240, another decoding cycle 201 begins. 
The decoder finally terminates when the termination condition is satisfied. 

Effect of the Invention 

[01 17] The invention is able to decode received words that have been corrupted 
by more than / bit flips, where tis a bounded distance decoding radius. 
Experiments show that our decoder is in fact able to decode many such received 
words. 

[0118] For example, we implemented our decoder for (N = 255, k = 127, d = 
27), EG(m=4, s=2, v=l) code. The associated Euclidean geometry EG(m=4,q=4) 
has 255 non-zero points and 5355 2-flats that do not contain the zero point, each 
including 16 points. We construct a parity check matrix representing the code with 
5355 rows and 255 columns. 

[0119] The bounded distance decoding radius for this code is t=\0, which means 
that the prior art majority-logic decoders will fail to decode if the received word 
has more than 10 bits flips. Our decoder very often succeeded in successfully 
decoding such received words. For example, if we corrupt a transmitted word with 
16 bit flips, then we only have a word decoding error rate of around .00001. If we 
corrupt the transmitted word with 18 bit flips, then the word decoding error rate is 
only around .00015. If we corrupt the transmitted word with 20 bit flips, the word 
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decoding error rate is only around .0008. If we corrupt the transmitted word with 
22 bit flips, the word decoding error rate is around .004. 

[0120] These performance results indicate that our decoder out-performs 
comparable BCH codes that use bounded-distance decoders. For example, the 
parameters of a comparable BCH code are (N=255,k=123,d=39). This code has a 
slightly worse error rate than our code, and the bounded distance decoding radius 
is only 19. 

[0121] If a prior-art algebraic bounded distance decoder for the BCH code is 
used, then the decoder's failure rate is 100% when a received word has more than 
19 bit flips. Surprisingly, the failure rate of our decoder is only 0.004 % when a 
received word has as many as 22 bit flips. These are surprising results. 

[0122] These unexpected results are significant, because for codes of this block- 
length and rate, algebraic bounded-distance decoders for BCH codes had, up to 
now, given the best reported performance for decoding on the binary symmetric 
channel. 

[0123] We obtain similar experimental results for other codes based on finite 
geometries, including the (N=511,k=256,d=31) Reed-Muller code. 

[0124] Although the invention has been described by way of examples of 
preferred embodiments, it is to be understood that various other adaptations and 
modifications may be made within the spirit and scope of the invention. Therefore, 
it is the object of the appended claims to cover all such variations and 
modifications as come within the true spirit and scope of the invention. 
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